Optimizing Statistical Queries by Exploiting Orthogonality and Interval Properties of Grouping Relations
نویسندگان
چکیده
A statistical query rst manipulates source category data to build a target category in the form of a grouping relation and then performs statistical functions on the associated measurement data. In this paper, the attributes in a grouping relation are partitioned into pair-wise disjoint sets, each called a dimension. A grouping relation is said to be orthogonal if it is equal to the cross product of the projections of itself on all the dimensions. Orthogonality is useful in searching for and using pre-computed summaries on other categories. However, a grouping relation is sometimes not orthogonal, but rather k-partially orthogonal (i.e., the union of k orthogonal ones). It is shown that it is NP-complete to decide if a grouping relation is k-partially orthogonal. The paper then gives an algorithm to derive partial orthogonality. Also investigated in this paper are interval properties of grouping relations useful for optimizing statistical queries. An algorithm is described to derive interval properties.
منابع مشابه
Relational Databases Query Optimization using Hybrid Evolutionary Algorithm
Optimizing the database queries is one of hard research problems. Exhaustive search techniques like dynamic programming is suitable for queries with a few relations, but by increasing the number of relations in query, much use of memory and processing is needed, and the use of these methods is not suitable, so we have to use random and evolutionary methods. The use of evolutionary methods, beca...
متن کاملTopological and Geometrical Reasoning in 3d Grouping for Reconstructing Polyhedral Surfaces
We are developing a system for reconstructing polyhedral surfaces from multiple images. This process can take advantage of the topological relations of the observed image features triggering and therefore speeding up the grouping of features to polyhedral surfaces. Exploiting the statistical properties of features when grouping them leads to consistent decisions being invariant to numbering and...
متن کاملOptimization for Queries with Holistic Functions
The early grouping technique is a new method for optimizing aggregate queries. It provides more opportunities for the query optimizers to find optimal plans because all possible placements of the GROUP BY operators in the query trees are considered during the optimization process. Howeve1; to employ this technique, one of the requirements is that the aggregate function in the query must be dist...
متن کاملINCOMPLETE INTERVAL-VALUED HESITANT FUZZY PREFERENCE RELATIONS IN DECISION MAKING
In this article, we propose a method to deal with incomplete interval-valuedhesitant fuzzy preference relations. For this purpose, an additivetransitivity inspired technique for interval-valued hesitant fuzzypreference relations is formulated which assists in estimating missingpreferences. First of all, we introduce a condition for decision makersproviding incomplete information. Decision maker...
متن کاملOptimizing GPU-accelerated Group-By and Aggregation
The massive parallelism and faster random memory access of Graphics Processing Units (GPUs) promise to further accelerate complex analytics operations such as joins and grouping, but also provide additional challenges to optimizing their performance. There are more implementation alternatives to consider on the GPU, such as exploiting different types of memory on the device and the division of ...
متن کامل